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Abstract 

For settings with a binary treatment and a binary outcome, instrumental 
variables can be used to construct bounds on a causal treatment effect. With 
continuous outcomes, meaningful bounds are more difficult to obtain because 
the domain of the outcome is typically unrestricted. In this paper, we combine 
an instrumental variable and subjective assumptions in the context of an obser- 
vational cohort study of HIV-infected women to construct meaningful bounds 
on the initial-stage causal effect of antiretroviral therapy on CD4 count. The 
subjective assumptions are encoded in terms of the potential outcomes that are 
identified by observed data as well as a sensitivity parameter that captures the 
impact of unmeasured confounding. Measured confounding is adjusted using the 
method of inverse probability weighting (IPW). With extra information from an 
IV, we quantify both the causal treatment effect and the degree of the unmea- 
sured confounding. We demonstrate our method by analyzing data from the HIV 
Epidemiology Research Study. 
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1 Introduction 



Observational studies offer an important alternative to randomized clinical trials when 
assigning a treatment to study subjects is unethical or practically impossible ( jRosenbaum 
20021 ). However analyzing data from such studies often confronts the difficulty that 



a direct comparison between the treated and untreated subjects does not necessarily 
show the causal effect of the treatment due to confounding. Informally, confound- 
ing is caused by factors that ha ve causal effects on both treatment and outcome (see 
VanderWeele and Shpitserl 2011 . for more rigorous discussions and definitions of con- 



founding and confounders) . To adjust for the confounding effect, observational studies 
typically include collecting a set of covariates with the hope that most confounders if 
not all are measured. Statistical methods for controlling for the confounding effect un- 
der the assumption of no unmeasured confounding include multivariate adjustment via 
regression models, propensity score r isk adjustment, propen s ity score matching, and 
inverse probability weighting (IPW) (IBang and Robins! 120051; iKang and Schaferl 120071; 



Robins et alil994 



19981; iRobins et al. 



2000 



ang 

Rosenbaum and Rubinlll984j ; iHogan and Lancasterll2004j ; iD'Agostino 



among others). 

The assumption of no unmeasured confounding is untestable in general and very 
often implausible. In this case, making causal inference on the treatment effect as well 
as obtaining a quantitative measure about the degree of unmeasured confounding is 
imperative. The two objectives are generally not achievable in a single observational 
study, but possible given the existence of an instrume ntal variable (IV). 



The IV methods can be traced back to 1920 's (jStock and Trebbil 120031 ; IWright 



19281 ). and have been extensively implemented in econometric and recent bio-medical 
research. Loosely speaking, an IV can be envisioned as a 'randomizer' which varies 
exogenously, has a causal effect on treatment received, but has no causal effect on the 
outcome except through treatment. By convention, these three conditions are referred 
to as the exogeneity , monotonicity, and exclusion restriction assumptions, respectively 
( lAngrist et al.lll996l ). When a valid IV exists, it can be used to draw inference about the 
causal treatment effect, despite the existence of unmeasured confounding. However, the 
IV estimate of the treatment effect applies onl y to a specific non-identifiable subpopula- 
tion unless additional assumptions are made (lAngrist et al.lll996l ; llmbens and Angrist 



1994J ) . In simple settings with a binary treatment and a binary outcome, IV methods 



also can be used to construct bounds on a population causal treatment effect (IRobins 



Manski 


1990; 


Joffe 


2001; 



the uncertainty of the impact of unmeasured confounding is accounted for by a bound 
(in contrast to a point) estimate. With continuous outcomes, meaningful bounds are 
not straightforward to obtain because the domain of the outcome is typically unre- 
stricted. 

In this paper, we consider the case of having an observational study with a continu- 
ous outcome and a valid IV. We propose to combine the IV and subj ective assumptions , 
in the context of the HIV Epidemiology Research Study (HERS) (jSmith et al.lll997l ). 
to construct meaningful bounds on (1) the population average treatment effect (ATE) 
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and (2) the degree of unmeasured confounding. The HERS was conducted when the 
highly active antiretroviral therapy (HAART) first became available to HIV-infected 
patients but was not randomly assigned. Of particular interest was the initial-stage 
causal effect of HAART on patient's CD4+ T lymphocytes (CD4) count, an important 
immunological marker for immune system function and disease stage. The study was 
conducted at two types of study site, community clinics and academic health centers, 
which we use as an IV. The study had collected an extensive set of covariates which 
can be used to adjust for part of the confounding effect, but unmeasured confounding 
may still exist and the magnitude of its impact is unclear. To have a sense of its im- 
pact, many HIV-positive individuals in the early HAART era were reluctant to initiate 
therapy due to fear of adverse side effects and toxicity. At the same time, physicians 
tended to prescribe HAART to patients with poor health condition, particularly with 
low CD4 count. These confounding factors were not fully measured and could possibly 
confound the HAART effect in a non-negligible way. 

To account for the measured con founding effec t , we i mplement the inverse proba- 
bility weighting method proposed by iRobins et al.l (I1994J ) . assuming that each subject 
has a probability between and 1 to receive HAART. In the ideal case when unmea- 
sured confounding is absent, the IPW method can con sistently estimate the A TE, and 



can be augmented to achieve double robustness fs ee iBang and Robins! 120051 ). With 



unmeasured confounding, we adopt the method of IRobins et al. fll999f ) and incorpo 
rate a sensitivity parameter into the IPW estimating equations to capture the effect 
of unmeasured confounding. The sensitivity parameter, defined as the systematic dif- 
ference between the treated and untreated patients if hypothetically having these pa- 
tients exposed to the same treatment condition, provides a measure of the magnitude 
of unmeasured confounding. Without external information, however, the sensitivity 
parameter is not identified by observed data. So in practice, this parameter is often 
used to conduct a sensitivity analyses to assess the rob ustness of the estimated causal 



treatment effect ( IKo et al.ll2003l ; iBrumback et al.l 120041 ) to unmeasured confounding. 



In this paper, differential HAART prescription rates at the two types of study site 
(study site used as an IV) provide an extra piece of information that indeed allows us 
to infer the magnitude of the sensitivity parameter. In this paper, we impose a set 
of subjective assumptions in the context of the HERS. These subjective assumptions 
are encoded in terms of the potential outcomes as well as a sensitivity parameter for 
unmeasured confounding. Putting together the sensitivity parameter, the IPW esti- 
mating equations to account for measured confounding, and the IV estimating equation 
leads to a system of es timating equations, unified under a constraint imposed by the 
principal stratification (IFrangakis and Rubin! 120021 ) . By solving the unified equations, 
we achieve our two objectives to (1) estimate the population average treatment effect of 
HAART at the initial treatment stage and (2) quantify the magnitude of unmeasured 
confounding. 

The rest of the paper is organized as follows: More details about the HERS are 
provided in Section |2j Notations and models are given in Section [3j In Section HJ 
we review the IV method and the IPW method, and introduce a unified system of 
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estimating equations based on them. In Section we present a set of contextually 
plausible constraints and assumptions and develop bounds on the average treatment 
effect of HAART on CD4 count and on the degree of unmeasured confounding. In 
Section [61 we analyze the HERS data, and finally in Section [71 we offer some points 
for discussion. 

2 Motivating Example: HERS 

The HERS was conducted from 1993-2001 to investigate the nat ural history of HIV 



progression in women. Details of the HERS have been reported in lSmith et al.l ( 119971 ). 
In the study, a total of 871 HIV-infected women were enrolled at four study sites: 
Baltimore, New York City, Detroit, and Providence. The first two sites were community 
clinics while the other two were academic medical centers. Clinical outcomes such as 
CD4 count were recorded about every six months since enrollment. Starting from 
1996, HAART became the recommende d treatment regimen f or HIV infected people, 



especially for those with low CD4 counts ([Carpenter et al.ll2000l ). Our analyses use data 



extracted from 201 women at their 8th visit, who 6 months previously were HAART- 
naive and had low CD4 counts (< 350 cells/mm 3 ). Using the HERS data, we want to 
estimate the initial-stage causal effect of HAART on CD4 count among this population. 
The study had collected a rich set of covariates, but unmeasured confounding might 
still exist. 

The characteristics of the 201 women are summarized in Table [TJ Among them, 
46 (23%) have initiated the HAART. Those receiving HAART have a higher CD4 
count on average than those not on HAART, but this "as- received" treatment effect 



(iTen Have et al.l 120081) is not statistically significant (standard normal z statistic = 
0.58). iKo et al.l (120031 ) analyzed the data from the HERS and screened out several 
candidate confounders, which we list in the upper panel of Table [TJ In brief, patients 
receiving HAART are more likely to be aware their HIV status and on HAART at 
enrollment and at the previous visit; less likely to show any HIV symptom and be a 
drug user; have higher viral loads (HIV-RNA) at enrollment and at the previous visit; 
and consist of relatively more white and less black. 

In this paper, the type of study site is used as an IV assuming that conventional 
IV assumptions (outlined in Section 13. ip are satisfied. The validity of making these 
assumptions will be discussed in Section [71 In the lower panel of Table [U we summarize 
the patient's CD4 count and HAART receipt rates stratified by the type of study site. 
Notably, patients at academic centers are more likely to be prescribed HAART (28 
versus 18%) than those at the community clinics, and their average CD4 count is 
slightly higher. With the exogeneity assumption, this difference in CD4 count is the 
causal effect of study site. Further with the exclusion restriction, it is the causal effect 
of the differential HAART assignment between the two types of study sites. In the 
following, we will explore using this extra piece of information in conjunction with 
other assumptions to infer the causal effect of HAART and unmeasured confounding. 
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Table 1: Summary of patient demographic characteristics by HA ART receipt status 
and study site. The numbers inside parentheses are standard errors, z stands for 
a standard normal test for comparing two sample means, and \\ f° r a chi-squared 
statistic for Pearson's chi-squared test. 





Received HAART? 


Comparison 




Yes 


No 


statistic 


Number of patients, n 


46 


155 




Average CD4 counts, cell/mm 3 


229 (19) 


216 (11) 


z = 0.58 


Candidate confounders 








Race: 








black; white; others 


46; 28; 26% 


61; 15; 24% 


Xl = 5.1 


ART receint rate 








at enrollment, % 


50% (7.4) 


39% (3.9) 


z = 1.2 


at previous visit, % 


74% (6.5) 


57% (4.0) 


z = 1.9 


Presence of HIV symptom, % 


26% (6.5) 


37% (3.9) 


z = 1.2 


HIV RNA, logio copy/mm 3 








at enrollment, average 


3.2 (.15) 


3.1 (.07) 


z = .78 


at previous visit, average 


3.7 (.15) 


3.4 (.09) 


z = 1.5 


Intravenous drug use 








recent, % 


22% (6.1) 


.25 (.035) 


z = .19 


lifetime, % 


61% (7.2) 


.63 (.039) 


z = .04 


Aware of HIV status % 


83% (5.6) 


.81 (.032) 


z = .08 




The HERS 


study site 






Academic centers 


Community clinics 




Number of patients, n 


93 


108 




HAART received, n; % 


26; 28% 


20; 18% 


z = 1.4 


Average CD4, cell/mm 3 


230 (14) 


210 (12) 


z = 1.0 
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3 Notations and Definitions 



3.1 Notations 



We use Z to denote the IV (in the HERS, Z = 1 if the study site is an academic medical 
center and = otherwise), and A z the potential treatment status that an individual 
would receive should Z be set to z. Hence, each individual has a pair of potential 
treatments (Ai,A ) that she would potentially receive at the two types of study site. 
The actual treatment received is A = Az = A\Z + A§{ \ — Z), where A = 1 means that 
the individual receives HAART, and otherwise. With the Stable Unit Treatment 



Value Assumption (jRubinl 11974 ) . we use Y z (a) to denote the potential outcome for an 
individual should we hypothetically set the IV to z and her treatment to a. Thus, the 
actual outcome observed is Y = Yz(A) = Yz(Az)- Further, we denote all confounders 
by a vector X, and the measured confounders by V, a subvector of X. The observed 
data consist of n identically and independently distributed copies of {X i: Zi : A i: Yj}, 
i = 1,2, ... ,n. 

We assume that the conventional I V assumptions - the exogeneitu, exclusion restr ic- 
tion, and monotonicity assumptions ( Angrist et al. 19961 ; Imbens and Angrist 1994 ) - 
are satisfied. The exclusion restriction assumes Y(a) = Yi(a) = V (a), i.e. the IV 
has no direct effect on the outcome beyond its impact on individual's treatment. The 
monotonicity assumption requires Pr(A 1 > A ) = 1, i.e. an individual who would not 
receive HAART at an academic medical center would not do so at a community clinic 
either. The exogeneity assumes that Z is jointly independent of the potential outcomes 
and treatments, Z _L (A , A u Y(0), Y(l)). 

3.2 Definitions of Causal Treatment Effect 

The causal effect of HAART treatment can be defined at different levels. The average 
treatment effect (ATE), E{V(1) — V(0)}, is defined over the entire population. The 
ATE is of broad interest in public health and epidemiology, and is the parameter 
of interest in this paper . With an IV, the local average treatment effect (LATE; 
Imbens and Angristlll994j ). E{V(1) — Y(0) \ A = 0, A 1 = 1}, is defined as the average 



treatment effect among a subpopulation who would receive the treatment only when 
Z = 1. The LATE can be estimated given a valid IV, not subject to the presence 
of unmeasured confounding. However, the facts that this subpopulation is not fully 
identifiable and the interpretation of the LATE depends on the choice of IV pose 
a significant limitation for generalizing results to a broader population and to other 
settings. 

The relationsh ip between the ATE and L ATE can be expressed using the princi- 
pal stratification (IFrangakis and Rubin! |2002| ). For a binary instrument and a binary 
treatment, the principal stratification suggests that the population can be partitioned 
into four mutually exclusive subpopulations based on the potential treatments each 
individual would have. In our case, the potential treatments have the following four 
possible combinations, (A , A x ) G {(0, 0), (0, 1), (1, 0), (1, 1)}, where {(A , A x ) = (0, 0)} 
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indicates the subpopulation who would never receive the HAART; {(A , A%) = (0, 1)} 
is the subpopulation who would receive HAART only at academic medical centers; and 
so forth. The monotonicity assumption, Pt(Ai > A ) = 1, implies that the subpopu- 
lation {(Aq, Ai) = (1,0)} is an empty set. Henceforth, we denote the remaining three 
subpopulations by Vjk = {A = j, A\ = k}, j < k. 

We denote the ATE and LATE by /3 ATE and /3 LATE , respectively. Then their rela- 
tionship can be expressed by 







ATE 



7T i/3 LAlij + 7T o{/ioo(l) - /Uoo(0)} + 7r u {/in(l) - A*ll(0)}. 



(1) 



where 7ij k = Pr(P jfc ) and /ijfe(a) = E{Y(a) \ Vjk}- In this paper, this relationship is 
used to unify the IPW and IV estimation methods, and based on that, a system of 
estimating equations is developed to draw inference on the ATE of HAART as well as 
the magnitude of unmeasured confounding. 

4 Review of Estimation Methods 

We review the IPW and IV methods in this section. 
4.1 The IPW Method 

Putting aside the covariates for the mo ment, the potentia l outcomes can be expressed 



by a marginal structural mean model ( IGange et ajj 120071 ; iRobind Il999t iRobins et al. 
2000h 

(2) 



E[Y(a)] = /3 + (3 A ^a 



OA. 



Assuming that Y(a) _L A\V, i.e. unmeasured confounding is absent, we can estimate 
(3 ATE by the solution /3ipw to the IPW estimating equations 



IPW, 



5^(1, Ai) T Wii(Yi -f3 1 - Aifrpw) = 0, 



where W u = Aj/ efi^j} + (1 - A:)/U -ejVjy y)}, and e( V; 7) = Pr(A = 1\V) is the 
propensity score ( Rosenbaum and Rubin 19831 ) with a /-dimension parameter 7. We 
assume that < e(V; 7) < 1. 

The IPW method has several properties that are worth mentioning. The efficiency 
of the resulting esti mator can be improved by using stabilized weights to replace Wu 
(IMiguel et al.ll200ll ). The estimator can be augmented to achieve double robu stness if 
we further specify an outcome regression model on Y (IBang and Robins! 120051 ). More- 
over, if 7 is unknown, Apw remains consistent when 7 is replaced by a consistent 
estimator 7 that solves 

n 

U 2 ( 1 ):=Y,W 2i {A l -e(V l ; 1 )} = 0. 



where Wu is an appropriate weight function; e.g. W 2 % 
sions. 



de(Vi] 7)7^7 in logistic regres- 
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When un measured con founding: exists, C/iXApw) is biased, i.e. ^{^(Apw)} 0- 
In this case, iRobind (119991 ) proposed to introduce a sensitivity parameter r and then 
estimate (3 ATE by the solution /3mipw( t ) to the following modified IPW estimating 
equations 

n 

Wmipw, r) := 5^(1, ^) T ^ H {y; - - A^mipw} = o 



where Y/* = 3^ — r{A ; — e(V*; 7)} is the "outcome" corrected for the selection bias due 
to unmeasured confounding. For a binary treatment as in this paper, the sensitivity 
parameter can be defined as the contrast of the potential outcomes between the treated 
and untreated conditional on V. 



[a — a 



[E{Y(a)\A = a, V} - E{Y(a)\A = a', V}] 



with a = 1 — a'. In the context of the HERS, r > means that the HAART is 
preferentially given to those with higher CD4 counterfactuals Y(a); r < means the 
opposite is true; and when r = 0, no unmeasured confounding is implied and the 
resulting estimator /3mipw(0) = Apw- 

Without additional information from data, the parameter r is not identified. Hence, 
the resulting estimator /3mipw(t) is typically used to conduct a sensitivity analysis. 
That is, estimate (3 ATE using /3mipw( t ) as if r is known, and then exa mine the sensi- 
tivity of /3mtpw(t) by varying the value of r over its plausible range (IKo et al.l 120031 : 
Brumback et all 120041 ). 

With an IV and information extracted by IV, it becomes possible to draw inference 
about the ATE as well as r which quantifies the degree of unmeasured confounding. 

4.2 The IV Method 



The IV methods have been widely used in econometric research (c.f. IWooldrigeil2002l ). 
In our just-identified case with a single binary IV and a binary treatment, the standard 
IV estimating equations are 



IV 



Y,(hZ l ) T (Y l -(3 3 -p w A t )=0. 



Under the IV assumptions and Cov(Z, A) ^ 0, the solution 



IV 



YZ/Z -Y{\- Z)/{\- Z) 



(3) 



AZ/Z - A(l- Z)/(l- Z) 
is con sistent for /3 LATE f lmbens and Angrist 19941 ; Angrist et al.l[l996 ; Hernan and Robins 



20061 ). The bars in ([3]) calculate sample averages, e.g. YZ = J2i^i^i/ n - O ne impor- 
tant property of the IV method is that /3rv remains consistent despite of the existence 
of unmeasured confounding. 
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Under the framework of the generalized method of moment s, the IV estimating 



equat ions are solved using the two-stage least squares method (lAngrist and Imbens 



19951 ). and can further incorporate a weight matrix to allow for heteroskedastic or 
correlated residuals. The IV metho ds also can be ge neralized to deal with multiple IVs 
and non-continuous outcomes (c.f. IWooldrig 



5 A Unified System of Estimating Equations 

With a set of covariates and an IV in the HERS, we propose to combine the IV and 
IPW methods, and develop a unified system of estimation equations as follows, 

(U 2 (rf), E/aOWw, r), £/ 4 (Av)) T = 0, (4) 

with a constraint of ([[]). Note that in (jTJ), parameters Hu(0) and /%)(!) are the averages 
of unobserved potential outcomes and not identified. All other parameters are identified 
because 7Tn = E(A = 1\Z = 0), tt 00 = E(A = 0\Z = 1) , 7Tm = 1 — joo — 7Tn, 
/in(l) = E(Y\A = 1, Z = 0), and /i 00 (0) = E(Y\A = 0, Z = 1) flAngrist et al.lll996l . and 



Rejoinder). When natural limits exist on /in(0) and /ioo(l) e -g- with binary outcomes, 
both the ATE and r are partially identified to bounds. When no natural limits exist 
as is our case with continuous outcomes, additional prior information is needed to 
implement the constrained estimating equations system, which we will discuss next. 

We present three sets of assumptions in the context of the HERS. Each allows us to 
identify bounds on the ATE and unmeasured confounding parameter r. In Sections 15.11 
- 15.3} we assume that the sample size n is sufficiently large such that the sampling 
variations of the estimating equations (TjJ is ignored. In Section I5T41 and [5TB1 we discuss 
inferences on the sampling uncertainty of bound estimates for a finite n. 

5.1 Assumption on the Upper Limits of ^ii(O) and /ioo(l) 

The outcome variable of our interest is CD4 count, so both /ioo(l) an d /-fii(O) must be 
greater than zero. In our first set of assumptions, we make a simple assumption that 
there exist two upper bounds that 

Assumption (A): < /%)(!) < £i> < /in(0) < £ 0) with known £ and £i. 

Assu mption (A) leads to a simplified version of the Robin s- Manski type bound on 
the ATE dRobinslll989l : lManski|[l99ol ; Izhang and Rubinll2003h . It is straightforward to 



show that the ATE falls within the interval 

[6(6,0), 6(0, 6)], 

where to emphasize the unidentifiable parameters in (pQ), we define b(fi U (0), /i o(l)) = 
7T i x LATE + 7Tu{//ii(l) - /in(0)} + n 00 {fi 00 (l) - fi oo (0)}. 

Then the bound on r can be inferred by finding the values of r such that the 
corresponding solutions to /3 MIPW (r) are consistent with the above bound on ATE. For 
a given (3 ATE , the solution to ^(Aviipw^ t) = for r is 



> , oATE \ WiA *Wi(Y- f3 ATE A) - Wi * WiA(Y - f3 ATE A) 

T n (p ,11 = = = = = . 

W 1 A*W 1 (A-e(V;i))-W 1 *W 1 A(A-e(V;i)) 
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It is straightforward to verify that f ra (/3 ATE ,7) is a non-increasing function of /3 ATE , so 
the unmeasured confounding parameter r is bounded by 

[r(6(0,6),7),r(&(6,0), 7 )], 
where r(/3 ATE , 7) = foo(/3 ATE , 7). 

5.2 Constraint on Relationships between /in(0) and /ioo(l) an d 
Identified Quantities 

Assumption (A) alone is sufficient for identifying the bounds on ATE and r, but in 
practice the two upper limits need to sufficiently large and the resulting bounds can 
be wide. In the following, we consider making assumptions on the relative magnitude 
between the unidentifiable and identifiable quantities. 
Assumption (B): We assume that 

1. The average treatment effect among Vu is no less than a known Su, 

E{Y(1) - Y(0)\V u } = Ml) - MO) > V 

A plausible choice for Su is zero, that is, we assume that on average, Vu who 
would always receive HAART can on average benefit from HAART. We make 
this assumption because although suffering from confounding bias, the efficacy of 
HAART on the treated patients has been demonstrated by several contemporary 
studies. Further, we impose a known lower bound on the average treatment effect 
among P o that 

E{Y(1) -Y(0)\V 00 } = /%)(!) - MO) > <*oo- 

A negative value of Sqq implies that HAART can be harmful for those who would 
never receive HAART at either site. Setting <5oo = implies that HAART is also 
beneficial for them on average. 

2. The difference on E{F(0)} between Vu and Voo is bounded above, 

E{Y(0)\Vu} - E{Y(0)\V 00 } = MO) - MO) < V 

We can set S y0 = by our intuition that in the untreated condition, people who 
would always receive HAART have higher degree of HIV progression (lower CD4 
on average, compared to those who would never receive HAART). 

3. The difference of treatment effects between those who would always receive 
HAART and those who would never receive HAART is bounded below, 

E{y(i)-y(o)|Pn}-E{y(i)-y(o)|Poo} = {Mi)-Mo)}-{Mi)-Mo)} > <W 

For example, letting 5t r t = implies that the treatment effect on those who would 
always receive HAART is greater than those would never do so. 
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Under this set of assumptions, we show that the ATE is bounded by 

[6(c ,Ai o(0) + S 00 ), 6(0, ci)] 

and r by 

[r(6(0,ci),7), r(6(c ,//oo(0) + *oo),t)]> 
where c = min{/iu(l) - 5u,// O o(0) + S y0 } and c x = /in(l) +/U o(0) - <W 
5.3 Constraint Conditional on Measured Covariates 

For the HERS, it may be more realistic to assume that Assumption (B) holds condi- 
tional on clinically important covariates V. So we propose our third set of assumptions 

as 

Assumption (B'): 

1. E{Y(1)-Y(0)\V U ,V} > <J U ; E{Y(1) - Y(0)\V oo , V} > 5 00 . 

2. E{Y(0)\V n ,V} - E{Y(0)\V 00 ,V} < 5 y0 . 

3. E{y(l) - Y(0)\Vu,V} - E{Y(1) - Y(0)\V 00 ,V} > 5 trU for known <S U , 5 00 , 5 y0 
and 5 trt . 

4. Further, we assume that the monotonicity and exclusion restriction assumptions 
hold conditional on V, and the constraint JT]) becomes 

= vr * /3 LATE + / E{y(l)-y(0)|P n ,U}P(Pn^)dF(lO 
Jv 

+ f E{Y(1) -Y(0)\V O o,V}P(V oo \V)dF(V), (5) 

where V is the support of V with a distribution F(V). We write the product 
ttqi * /3 LATE as before because both the LATE and 7r i are identified by the data. 
Again, no observed data are available for E{F(0)|7 : 'ii, V} and E{F(l)|'Pocb V}, 
which are denoted by /xn(0, V) and /%)(!> ^0> respectively 

Under (B') and (JSJ), we obtain a bound on ATE 

[ttoixLATE+ / 6 y (c (V),c 2 (y))dF, n 01 x LATE + / 6 V (0, Ci(y))dF] 
iv Jv 

and a bound on r 

[r(7r 01 xLATE+ / b v (0, Cl (V))dF, 7), r(7r 01 x LATE + / 6 y (c (U), c 2 (U))dF, 7)], 

where b v (^ n (0, V), m {\, V)) = [E{Y{1)\V X1 , V} - /x n (0, V)] Pi{V n \V) + [/%)(!, V) - 
E{r(0)|P o, ^}] Pr(Pool^), = mm(E{Y{l)\V u , V} - S n , E{Y(0)\V 00 , V} + S y0 ), 

Cl (V) = E{Y(l)\V n , V} + E{F(0)|P O o, V} - 5 trt , and c 2 (V) = E{Y(0)\V 00 , V} + 5 00 . 
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5.4 Inference from Finite Samples 

To implement (JTJ, we can assume two regression models E(A\Z) = logit _1 (r/(Z; 9\)) 
and E(Y\A,Z) = k(A,Z;9 2 ) for some known functions rj(Z;9i) and k(A, Z; 9%). For 
binary Z and A, we use two saturated models and specify that rj(Z; 9\) = 9±o+9nZ and 
k(A, Z; 9 2 ) = 9 20 + 9 21 Z+9 23 A+9 23 AZ with 9 1 = (9 10 , 9 U ) T and 9 2 = (9 20 , 9 31 , 9 22 , 9 23 ) T . 
Here, a saturated additive model for k(A, Z; 9 2 ) is compatible with the structural model 
(EJ), since one can show that fl2]) suggests that E(Y|A, Z) is linear in A, Z, and AZ. 

Given two regression models, we can obtain consistent estimators 9i and 9 2 by 
solving 

Eti W 3i [A - lcgit" 1 ^; 9,)}] \ 
Y? i= xWu{Yi-K{A,Z-9 2 )} ) U ' 

where W 3i = Slogrtr 1 ^^; 9 1 )}/d9 1 and W 4i = (1, Z t , A u A^Y . Then, we esti- 
mate 7Tn = logit _1 {?7(0; ^i)}, vtoo = 1 — logit~ 1 {?7(l; vr i = logit _1 {?7(l; §{)} - 
logit- 1 ^^; <9i)}, Au(l) = 0; 2 ), and /x 00 (0) = «(0, 1; a ). 

For Assumptions (A) and (B), the function b() can be estimated by replacing 
those identifiable quantities with their consistent estimators, i.e. 6(/xn(0), /ioo(l)) — 
ttoiAv + 7Tn{/iii(l) - A*n(0)} + ^oo{/Uoo(l) - Aoo(0)}. Further, we estimate c by 
c (5n,^o) = min{(/tu(l)-5ii), (Aoo(0)+5 2/O )} and ci by ci(5t rt ) = /}ii(l)+/t O o(0)-<W 
By substituting these estimates for their estimands in the bounds, we obtain bound 
estimates on the ATE and r. The results are summarized in Table [2j 

To estimate the bounds on ATE and r under Assumption (B'), we proceed as 
follows: 

Step 1. We assume two observed-data models conditional on V, E(A\Z, V) = logit _1 (r/(Z, V 
and E(Y\Z, A, V) = k(Z, A, V; 4 ) for known rj(Z, V; 9 3 ) and k(Z, A, V; 9 4 ). For 
example, we can assume two linear models without interaction that r)(Z, V; 9 3 ) = 
9 30 + 9 31 Z + 9 32 V with 9 3 = (9 30 , 9 3U 9J 2 ) T and k(Z, A,V;9 4 ) = 9 m + 9 41 Z + 
#42^4 + 9j 3 V with 6*4 = (#40, #41, 9 42 , 9j 3 ) T . Let #4 and 9 3 denote the estimators of 
#4 and #3 by solving the corresponding estimating equations. 

Step 2. With the monotonicity assumption and exclusion restriction conditional on V, 
we estimate that E{Y(0)\V 00 ,V} = E(Y\Z = 1, A = 0, V = V) = 0, V; §4), 
E{Y(l)\P n ,V} = fi(0A,V;9 4 ), Pi(P n \V) = n(0,V;9 3 ), and Pr(V 00 \V) = 1 - 
7r(l, V; 9 3 ). Then we estimate the functions bv(), Cq(), ci(), and c 2 () by bringing 
in the above estimators. 

Step 3. We estimate the distribution F(V) by the empirical cumulative density function 
of V and integrals by empirical sums, e.g. estimate f v 6y(co(V), c 2 (V))dF by 

Ya=i &v(coO / *)> c 2 (Vi))/n. Then, we substitute the parameters in ((Sj) by their 
estimates. 

The resulting bound estimates on the ATE and r are summarized in Table [2j 
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5.5 Uncertainty Region for Estimated Bounds 

An interval that provides (1 — 9)100% coverage probability on a bound estimate is 
often called the (1 — 0)100% Uncertainty Region (UR) to distinguish it from a con- 
fidence interval. A UR takes into account both the sampling variability and partial 
identifiability. Two types of URs are considered in this paper, point-wise and strong 
(1 - 0)100% coverage URs. 

A point-wise UR (I, U) contains any particular value g G (I, U) with a probability 
of at least (1 — 9), where (I, U) denotes the true bound and g is the parameter gener- 
ating the data. If the L and U are consistent estimates and asymptotically normally 
distributed (CAN), a (1 - 0)100% point-wise UR is given by 

URp-can = [L- c*se(L), U + c*se(U)}, 

where se(-) is the standard error and c* is a critical value. When U — L is large 
compared to se(L) and se(U), c* ca n be approximated by $~ 1 (1 — 9) where $ is the 



normal cumulative density function (IVansteelandt et al.l 120061 ) . 



A strong UR is defined as an interval that contains the entire set (I, U) with a 



probability of at least (1 — 0) (IHorowitz and Manskil |2000| ; IVansteelandt et al.l 120061 ). 



If both L and U are CAN, a strong (1 - 0)100% UR is 

URs-can = [L — c se(L), U + c se(U)}, 

with c = $ _1 (1 — 0/2) = 1.96. Without assuming L and U to be CAN, a strong 
(1 — 0) UR can be obtained using the bootstrap method. Specifically, let (L*,U*) 
denote the estimated bound from a bootstrapped sample. A bootstrap strong 95% UR 
is the interval {If, U*) satisfying Pr*(L* < If, U* < U*) = 1 — 9 and Pr*(L* < If) = 
Pr*([/* > U*), where Pr* is the pr obability measure induced by the bootstrapped 



resamples (IBickel and Freeman! 1 19811 ). and so can be obtained by finding the shortest 
interval UR S -bts = {L*,U m ) that satisfies #^ L t< u k<u*) > i - Q, and #(L ^ } ~ 
~ k , where #() counts the number of statements that hold for k = 1,2, . . . , K. 

6 HERS Data: Treatment Effect Estimation and 
Confounding Assessment 

6.1 Preliminary Analyses 

The upper panel of Table |3] summarizes the IPW estimate of the ATE and IV estimate 
of the LATE of HAART on CD4 count. The IPW uses the variables listed in Tabled] 
as the measured confounders of V and assumes that e(V;j) = logit~ 1 ( 7 T y). The 
IPW estimate of the ATE suggests that HAART can boost patient's CD4 count by 27 
cells/mm 3 on average with a 95% confidence interval of (—16, 70). The IV estimate 
of the LATE suggests that for those who would receive HAART at academic medical 
centers but not at community clinics, HAART can increase CD4 count by 207 on 
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average with a 95% CI of (—250, 664). In Tabled we also list the "as-treated" (AT) 
treatment effect, which is estimated by the contrast of the average CD4 counts between 
those actually receiving HAART and those not. The difference between the IPW and 
AT estimates can be regarded as the bias of AT estimate that is attributable to the 
measured confounders. 

6.2 Bounds on HAART Treatment Effect and Unmeasured 
Confounding 

For Assumption (A), we let the upper limits £ = £i = 500. (Recall that the two limits 
are on the expected values of Y(0) among V\\ and Y(l) among Poo-) We choose the 
two limits based on the facts that the average CD4 count at the previous visit was 
much lower than 350 and at the eighth visit, the average CD4 count was 229 for those 
treated and 216 for those untreated (refer to Tables [1]). For Assumptions (B) and (B'), 
we let Six — 5 00 = 8 y0 = 8 trt = 0, and further for (B') let V be the variables listed 
in Table [TJ The lower panel of Table [3] summarizes the bounds estimates of the ATE 
and r. The bound estimate of the ATE (—196, 256) under (A) is not informative and 
much wider than those under (B) (20, 231) and (B') (18, 218). Assumption (B') is not 
necessarily stronger than (B), but by imposing observed data models and having the 
estimated bounds smoothed over covariates, tighter bounds are resulted in. To obtain 
uncertainty regions on these bound estimates, we draw K — 1, 000 bootstrap samples, 
fixing the number of patients at the two types of study sites (academic medial centers 
versus community clinics). Because the bounds estimates contain min() operation 
which complicates the derivation of their standard errors, we use the K bootstrapped 
samples to calculate the standard errors of the two ends of bound estimates. Table [3] 
summarizes the point-wise, strong, and bootstrap strong 95% coverage URs. The 
difference between the 95% CI of the IPW estimate and 95% URs under (B) and (B') 
can be regarded as the bias of the IPW estimate due to unmeasured confounding. The 
results suggest that unmeasured confounding tends to cause a downward bias and the 
true ATE is likely to higher than what the IPW 95% CI suggests. 

The bound estimates on r under (B) and (B') are (—204, 7.5) and (—191, 9.1), which 
are much tighter and more informative than the bound estimate under (A) (—229, 223). 
The 95% URs on r are listed in Table |3j A possibly negative value of r implies that 
unmeasured factors resulted in preferential prescriptions of HAART to those with fewer 
CD4 count in the HERS, and those on HAART might have up to 200 fewer CD4 count 
(if left untreated) on average compared with those not on HAART. 

6.3 Sensitivity to Unknown Parameters 

In this section, we conduct a simple sensitivity analysis for the unknown parameters 
used in the three sets of assumptions. We impose a common upper limit £ = £o = £i 
for Assumption (A) and let £ vary from 300 to 500. The bound estimates on the ATE 
and t along with bootstrap strong 95% URs are shown in Figured] (First row). For the 
considered range, £ has more influence on the upper (lower) bound estimate on ATE 
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Table 2: Estimated bounds on ATE and r under Assumptions (A), (B) and (B'). 





Parameter 


Estimated bound 


(A) 


ATE 

T 


to,o), S(o, eoi 

[f n (6(0,a),7), r n (6(e ,0),7)] 


(B) 


ATE 

T 


[S(cb,/too(0) + <Joo), 6(0, ci)] 
[f n (6(0,ci),7), f n (6(c ,/ioo(0) + 5oo),t)] 


(B') 


ATE 

T 


i^oiAv + ^'•^^^oiAv + E<uM r i(y<)) i 

[r„{7roiAv + ^ M T m) htn{^v + ^<*™*™>}] 



Table 3: Estimates of HAART treatment effect on CD4 and r. The 95% confidence 
intervals (CI) for point estimates and 95% uncertainty regions for bound estimates 
(UR P -can, URs-can, and URs-boot; see Section 5.5) are shown in bold font. We 
assume that £o — £i = 500 for (A); and that <5oo = <5n = S y o = Strt = for (B) and 
(B')- 







ATE 


T 


AT 


Point estimate 


13 






95% CI 


(-30, 56) 




IPW 


Point estimate 


27 






95% CI 


(-16, 70) 




IV 


Point estimate 


207 






95% CI 


(-250, 664) 




Assumption: 








(A): 


Bound estimate 


(-196, 256) 


(-229, 223) 




95% URp-can 


(-229, 289) 


(-269, 266) 




95% URs-can 


(-235, 295) 


(-277, 274) 




95% URs-bts 


(-233, 294) 


(-274, 273) 


(B): 


Bound estimate 


(20,231) 


(-204, 7.5) 




95% URp-can 


(-9, 280) 


(-260, 49) 




95% URs-can 


(-15, 289) 


(-271, 57) 




95% URs-bts 


(-14, 285) 


(-270, 57) 


(B') : 


Bound estimate 


(18,218) 


(-191, 9.1) 




95% URp-can 


(-10, 261) 


(-234, 48) 




95% URs-can 


(-16, 270) 


(-243, 56) 




95% URs-bts 


(-14, 270) 


(-243, 56) 



15 



(on r), and the resulting bound estimates remain wide and non- informative. 

For (B) and (B'), we let <5oo (the lower limit of treatment effect among P o) rang 
from —60 to 20 and fix <5 n = 5 y0 = 5 trt = 0. (More sophisticated sensitivity analyses 
that jointly evaluate 5n, 5 00 , 5 y0 and S trt are possible.) We choose this range for 
5oo based on the magnitude of the AT and IPW estimates, and have it tilt toward 
the negative side for the possibility that HAART could be harmful for those never 
receiving HAART. Figure [1] (Second and third rows) shows that <5 o only affects the 
lower (upper) bound estimates of the ATE (of r). The ATE can be as high as over 200 
cell/mm 3 , and the lower bound of ATE varies around zero depending on the value of 
5qo. Again, a possible negative value of r suggests that unmeasured confounding likely 
causes HAART to be preferentially prescribed to those with poorer health. 

7 Discussions 

In the study of HERS, we propose to use an IV and sets of contextually plausible as- 
sumptions to quantify the causal effect of a treatment as well as the degree of unmea- 
sured confounding. We consider three sets of assumptions. Assumption (A) specifies 
the limits of the expected unobservable potential outcomes, which leads to a simplified 
version of the Robins-Manski bounds on ATE. Assumptions (B) and (B') specify the 
relative magnitudes between identified and unidentified potential outcome averages, 
and lead to bounds that can be much tighter than (A). The 95% uncertainty regions 
for the ATE under (B) and (B') are more informative than the 95% CI of the IV 
estimate, and have less concern of having unmeasured confounding bias compared to 
the 95% CI of the IPW estimate. The bound estimates on the ATE and r reveal that 
unmeasured confounding could cause a downward bias on the ATE because of HAART 
being preferentially prescribed to those with poorer health condition. 

Quantifying the degree of unmeasured confounding can be valuable for analysis 
of studies con ducted in similar s ettings but having no IV. Several HIV observational 



studies (e.g. iGange et al.l 120071 ) have been conducted contemporarily as the HERS, 
and could suffer from similar amount of unmeasured confounding. In those studies 
when unmeasured confounding is of concern, analyses should be complemented with a 
sensitivity analyses as described in Section A plausible range for r can be informed 
from our study. 

In this paper, we use the type of study site as an instrument variable, assuming 
that two crucial IV assumptions (monotonicity and exclusion restriction) are satis- 
fied. The observed HAART assignment rate at academic centers is higher than that at 
community clinics. Such an observation suggests that the deterministic monotonicity 
Pr(Ai > Aq) = 1 is plausible, but this assumption cannot be verified. As one limita- 
tion of our study, this assumption will be violated if some individuals would receive 
HAART at community clinics but not at academic medical centers. If the proportion 
of these individuals (Pio) is small, the bias due to the violation of the monotonicity 
assumption is probably negligible. Alternatively, one can assume that Poo is absent, 
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Assumption (A) 



Assumption (A) 




Figure 1: Sensitivities of bound estimates to £ = £ — £1 under Assumption (A); to 
5 00 under Assumptions (B) and (B'). The gray zones show the bound estimates as a 
function of £ or 5 00 . The bootstrap strong 95% UR S -bts's are shown as dashed lines. 
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so that Vn, Vqi and V\a form a partition of the population. This assumption al- 
lows everyone to have some chance of receiving HIV therapy, which is also sensible 
for the HERS because these patients' CD4 counts are less than 350 six months be- 
fore, and allows for the possibility that some people would potentially be treated at a 
community clinic but not an academic medical center. With this assumption, the pro- 
portions of Pn, Vq\ and V\q are identified because n 01 = Pr(A = 0\Z = 0), Pr('Pio) = 
Pr(A = 0\Z = 1), and Pr(Vn) = 1 — ti 01 — Pr('Pio). The following estimands are also 
identified: E{Y(0)\V 01 } = E(Y\A = 0, Z = 0), E{Y(0)\V 10 } = E(Y\A = 0, Z = 1), 
E{F(l)|PnorP i} = E(Y\A = 1,Z= 1), and E{Y (1)\V u otV w } = E(Y\A = 1,Z= 0). 
A challenge here is how to incorporate the IV estimator, which now has an estimand as 
a 'weighted' contrast of the average treatment effects between P i and Via, to construct 
constraint similar to ([I]). This may be worth further investigation. 

Moreover, replacing the deterministic monoto nicity with a stoc hastic monotonicity 
assumption deserves explorations in the future. iRoy et al.l ( 120081 ) assumed Pr(v4i = 
1\Aq — 1,V) > Pr(Ai = 1\Aq = 0,V), and proposed to use aux iliary covariates to 
estimate the memberships of principal strata. ISmall and Tan! (120081 ) assumed Pr(v4i = 
1\U) > Pr(A = 1\U) with U being a latent variable satisfying certain conditions. 
These stochastic monotonicity assumptions allow the possible presence of V\q and may 
be more realistic in the HERS than the deterministic monotonicity. 

The exclusion restriction could also be violated if the type of study site Z remains 
associated with the outcome Y after accounting for the effect of Z on HAART receipt. 
A weaker exclusion restriction assumption can be made, if the association between 
the instrument and the outcome can be removed after conditioning on som e meas ured 
covariate V*, i.e. {V(l), V(0)} _L Z\V*. In this case, the methods by iTanl (120061 ) can 
be implemented for identifying the LATE, and our method for bound estimation on 
ATE and r still applies. 

There are several ways to account for the measured confounding. We use the method 
of inverse probability weighting by specifying a propensity score model. Alternatively, 
we can specify both an outcome re gression model and a pr opensity score model and use 
the doubly robust (DR) estimator (IBang and Robinsll2005l ) to estimate the ATE. We do 
not implement the DR estimator in this paper because when unmeasured confounding 
exists the DR estimator is no longer guaranteed to be cons istent for ATE and could 
suffer more bias than other estimators. The simulations of iKang and Schaferl (120071 ) 
suggest that IPW is relatively robust to the impact of unmeasured confounding in term 
of estimation bias. Because the focus issue of this paper is unmeasured confounding, 
we use the IPW for estimating ATE. 
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